Probabilistic Dense Depth from Monocular Images

نویسنده

  • Adel H. Fakih
چکیده

T his proposal presents a probabilistic formalism that strives for a real time estimation of depth and 3D motion from monocular images. Given a moving camera equipped with an inertial measurement unit and capturing images of a scene with moving objects, we would like to determine, in real time, the 3D structure of the scene in the field of view of the camera as well as the different 3D motions with respect to the camera. Since the structure we are seeking is dense and the frame rate is high it is better to use optical flow rather than point correspondences. This problem has been the subject of thorough study and research over the past two decades; however, in terms of time efficiency, accuracy and ability to deal with diverse situations, the results achieved by various approaches do not meet the requirements of real-life applications. The problem presents many inherent difficulties due mainly to the noise induced by the imaging process, to the optical flow being an approximation of the velocity field and to the sensitivity of the problem to noise implied by the underlying mathematics. Recursive estimation has been always known as a viable framework for this problem. However due to high dimensionality of the depth field, recursive estimation is problematic. Almost all of the previous approaches assume a known motion or estimate depth and motion independently which allows them to do a pixel wise estimation of depth. The proposed formalism is tailored to address these specific problems and rests on three cornerstones. (1) The first one is a Rao-Blackwellised Particle Filter which partitions the state-space into two subspaces, a 3D velocity subspace estimated recursively using a Particle Filter, and a depth subspace conditional on the velocity subspace. The depth field of each particles is cast as a Markov Random Field and estimated in two steps (second and third cornerstones). (2) An Extended Kalman Filter is employed to recursively estimate the depth at each point using optical flow likelihoods. (3) A recursive approach is proposed to segment the depth field and is followed by a probabilistic depth regularization within the segments using belief propagation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CodeSLAM - Learning a Compact, Optimisable Representation for Dense Visual SLAM

The representation of geometry in real-time 3D perception systems continues to be a critical research issue. Dense maps capture complete surface shape and can be augmented with semantic labels, but their high dimensionality makes them computationally costly to store and process, and unsuitable for rigorous probabilistic inference. Sparse feature-based representations avoid these problems, but c...

متن کامل

Learning Deeply Supervised Visual Descriptors for Dense Monocular Reconstruction

Visual SLAM (Simultaneous Localization and Mapping) methods typically rely on handcrafted visual features or raw RGB values for establishing correspondences between images. These features, while suitable for sparse mapping, often lead to ambiguous matches at texture-less regions when performing dense reconstruction due to the aperture problem. In this work, we explore the use of learned feature...

متن کامل

Learning monocular visual odometry with dense 3D mapping from dense 3D flow

This paper introduces a fully deep learning approach to monocular SLAM, which can perform simultaneous localization using a neural network for learning visual odometry (L-VO) and dense 3D mapping. Dense 2D flow and a depth image are generated from monocular images by sub-networks, which are then used by a 3D flow associated layer in the L-VO network to generate dense 3D flow. Given this 3D flow...

متن کامل

Probabilistic Semi-Dense Mapping from Highly Accurate Feature-Based Monocular SLAM

In the last years several direct (i.e. featureless) monocular SLAM approaches have appeared showing impressive semi-dense or dense scene reconstructions. These works have questioned the need of features, in which consolidated SLAM techniques of the last decade were based. In this paper we present a novel feature-based monocular SLAM system that is more robust, gives more accurate camera poses, ...

متن کامل

Bayesian depth estimation from monocular natural images.

Estimating an accurate and naturalistic dense depth map from a single monocular photographic image is a difficult problem. Nevertheless, human observers have little difficulty understanding the depth structure implied by photographs. Two-dimensional (2D) images of the real-world environment contain significant statistical information regarding the three-dimensional (3D) structure of the world t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007